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Abstract 

I study the quantum mechanics of a spin interacting with an "apparatus" . 
Although the evolution of the whole system is unitary, the spin evolution is 
not. The system is chosen so that the spin exhibits loss of quantum co- 
herence, or "wavefunction collapse", of the sort usually associated with a 
quantum measurement. The system is analyzed from the point of view of 
the spin density matrix (or "Schmidt paths"), and also using the consis- 
tent histories approach. These two points of view are contrasted with each 
other. Connections between the results and the form of the Hamiltonian are 
discussed in detail. 



1 Introduction 

A cosmologist must face the the issue of interpreting quantum mechanics 
without the benefit of an outside classical observer. By definition, there is 
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nothing "outside" the universe! The traditional role of an outside classical 
observer is to cause "wavefunction collapse". This process causes a definite 
outcome of a quantum measurement to be realized, with the probability for 
a given outcome determined by the initial wavefunction of the system being 
measured. It is common to view this process as something the can not be 
described by a wavefunction evolving according to a Schrodinger equation, 
but which instead must be implemented "by hand" . 

There is a growing understanding |], @, |3|, H, H, B S H> that the 
essential features of wavefunction collapse can be present in systems whose 
evolution is entirely unitary. The key is the inclusion of an "environment" or 
"apparatus" within the Hilbert space being studied. A subsystem can then 
exhibit the non-unitary aspects of wavefunction collapse even though the sys- 
tem as a whole evolves unitarily. The wavefunction can then divide up into a 
number of different terms, each of which reflect a different "outcome". When 
there is negligible interference among the different terms during subsequent 
evolution, the "definiteness" of the outcome is realized in a restricted sense: 
Each term evolves as if the others were "not there", so a subsystem state 
within a given term evolves with "certainty" that its corresponding outcome 
is the only one. None the less, the total wavefunction describes all possible 
outcomes, and one is never singled out. 

Some people object to all the "extra baggage" or "many worlds" JTTJ] which 
result from retaining all possible outcomes. However, this approach has 
the advantage of allowing quantum mechanics to operate in a much more 
fundamental way, and predict which subsystems can play the role of classical 
observers. Unless the predictions are falsified, this approach can never be 
shown to be wrong. 

In what follows I present a simple toy system which is designed to il- 
lustrate the essential features of a quantum measurement. A very primitive 
"apparatus" is coupled to a two state "spin". The whole spin-apparatus 
system evolves unitarily and remains in a pure state, even as the two sub- 
systems exhibit the non- unitary evolution associated with the measurement. 
Both the "consistent histories" point of view, and a more conventional point 
of view (using reduced density matrices or "Schmidt paths" ) are use to ana- 
lyze the same process. The connections between the two points of view are 
discussed. 

This paper is an expanded version of a talk presented at the "Workshop 
on time asymmetry" in Mazagon Spain ||12|. The results are the same, but in 
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this paper I describe the Hamiltonian, and explore in detail the relationship 
between the results and my choice of Hamiltonian. The purpose is to develop 
some intuition as to what attributes make a "good classical observer". I 
also discuss the relationship between the "consistent histories" and more 
traditional approaches in more detail. 

The organization of this paper is as follow: Section 2 presents some math- 
ematical tools. Section 3 introduces the toy model and illustrates what it has 
to do with a quantum measurement by analogy with the double slit exper- 
iment. Section 4 shows the behavior of the toy model in more quantitative 
detail. Section 5 introduces the "consistent histories" point of view, and 
section 6 applies this point of view to the toy model. Section 7 explains 
specifically how the form of the Hamiltonian allows the density matrix evo- 
lution described in section 4 to be achieved. Section 8 explains how the form 
of the Hamiltonian allows the consistent histories described in Section 6 to 
be achieved. Section 9 discusses the relationship between the Schmidt paths 
and consistent histories. Section 10 explores the fundamental role played by 
the statistical "arrow of time" in the processes under study. Conclusions are 
presented in Section 11, and a number of technical results are presented in 
the Appendices. Units in which h — 1 are used throughout. 



The focus of this paper is the evolution of the spin and apparatus subsystems 
from pure into mixed states due to correlations being set up between the 
subsystems. The "Schmidt decomposition" provides a useful tool for dealing 
with these issues, and it will be used throughout this paper. 

If a closed system in a pure state \vp) is divided into two subsystems, one 
might want to think about \ip)'s of the form 



One could then say subsystem 1 is in the pure state iV^ij an d subsystem 2 
is in the pure state |^) 2 . However, this "direct product" form for \ip) is far 
from general. A general \ip) would look like: 



2 Tools 




(i) 




(2) 
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where and { | J ) 2 } span the two respective subspaces. Then one can 

not talk about pure states for subsystems 1 and 2. However, one can say 
something along these lines if \ip) takes the special form: 

= I>i|z)i <g> |i) 2 ( 3 ) 

i 

for some orthonormal sets and {l^}- This form is special because there 
is only one summation, and each state for system 1 is uniquely correlated 
with a specific state for system 2. The reduced density matrix of system 2 is 
then 

p 2 = \r l {\^){^\) =J2 a-aS) 22(t\ (4) 

i 

which is diagonal in the {1^)2} basis. The result is that the probability 
assigned to any state \x) 2 of the spin is 

(x\p 2 \x) = J2a*ai\(x\i) 2 \ 2 , (5) 

i 

which is an incoherent sum over the probabilities of each state \i) 2 , weighted 
by the probability a*ai assigned to that particular state. 

One can thus regard system 2 to be in state \i) 2 with probability a*a>i. 
Although quantum mechanics allows one to assign probabilities for the spin 
to be in any state, the basis in which p 2 is diagonal is special, because only 
in that basis does any matrix element of p 2 take the form of an incoherent 
sum as depicted in Eq (|]) (with no interference terms like {x\i) 2 dj\x)) . 

It turn out that the "special form" of Eq (|^) can always be realized. It 
is called "Schmidt" form, and follows directly from the fact that any density 
matrix can be diagonalized. The Schmidt bases, {|«)f} and {|z)f}, are the 
eigenstates of the reduced density matrices p\ and p 2 , and oti = ^/pl, where 
Pi are the eigenvalues (pi and p 2 have the same eigenvalues, and the larger 
one has additional zero eigenvalues). For more discussion of the Schmidt 
decomposition see |TB|, [TJ|, [I], |I5L [TJ], [T7|, Ref [Tj| contains the original 



mathematical result, and a brief proof is offered in 

The Schmidt decomposition allows one to expose precisely which corre- 
lations are present between two subsystems. The special form of Eq |] shows 
that state |l)i is uniquely correlated with state 1 1 ) 2 and so on. It also allows 
one to make the clearest possible statement of the "state of a subsystem", 
by providing the eigenstates and eigenvalues of the relevant reduced density 
matrix. 
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3 The toy model 



3.1 Defining the model 

The toy model discussed here is two state "spin" (system 2) coupled to a 
modest sized "apparatus" (system 1). The Hamiltonian is the same one used 
in HT31, which takes the form 



H = H 1 ®I 2 + I 1 ®H 2 + H I (6) 

where Ik represents the unit operator in the space of subsystem k. The first 
two terms give self Hamiltonians of the apparatus and spin respectively, and 
the last term gives the interaction between spin and apparatus. 

For this article I have chosen the parameters so that the interaction Hamil- 
tonian dominates over the self-Hamiltonians of the two subsystems. (Specif- 
ically Ei — E 2 — 0.1 and Ej = 10 in the notation of |18| .) The size of the 



system 1 is n\ = 25 here, as opposed to n\ = 12 in fl8 
The interaction Hamiltonian is: 



H I = E I (\]){]\®H\ + \i){i\®H{) (7) 



T I 

where H{ and H{ are two different random Hermitian matrices in the system 
1 subspace. (Each independent real and imaginary part of each element of 
H\ and H\ is chosen randomly on the interval [—0.5,0.5).) The random 
matrices are chosen once and for all at the start of the calculation, so Hj 
is time independent. For our purposes, the role of the self-Hamiltonians for 
the subsystems can be described by the statement "the deviation of the total 
Hamiltonian from Hi is very small". For more details see |L8] (A similar 
model is used in [[HI). 

The idea behind the form of Hi is very simple: If the spin is up the 
apparatus is pushed in one direction by H\ and if the spin is down, the 
apparatus is pushed in a very different direction by H\. The goal is to 
correlate different states in the apparatus with the ||) and ||) states for the 
spin. 
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3.2 The purpose of the model 

The toy model is designed to perform a very specific function: The model 
should take an initial state of the form 

|^> = (a|T) 2 + &|j) 2 )®P0i (8) 
and evolve it into the state: 

IVy> = a|T> 2 ®|y>i + &U>2®l^>i ( 9 ) 

Where (Y\Z) = 0. Actual numerical results showing this evolution are pre- 
sented in Section [|, and a detailed analysis of why this model is able to 
achieve these results appears in Sections [7[ 

Both Eqs (f|) and @ are in Schmidt form. Thus one can see that initially 
the spin (and the apparatus) is in a "pure" state. Later, p 2 develops two 
non-zero eigenvalues, so the spin is in a mixed state. The eigenstates of the 
final spin density matrix are |f) and ||). Thus, at the end the spin is clearly 
no longer in the a\]) 2 + b \{) 2 state, but it may be said to be in an incoherent 
superposition of ||) and ||). An important feature is that the probabilities 
assigned to |f) and |f) at the end (a* a and b*b respectively) are the same 
as those assigned initially. The only difference is that the initial state is a 
coherent superposition of ||) and ||). The choice of |j) and |j) as the final 
eigenstates of the density matrix was built into the dynamics (and the choice 
of initial state). Although the evolution of the spin is non-unitary (since the 
eigenvalues of its density matrix change), the evolution of the total spin plus 
apparatus system is chosen to be completely unitary. 

3.3 Analogy with the double slit 

What does this have to do with wavefunction collapse? One might expect 
a parallel description of the standard double slit experiment: After passing 
through a double slit, an electron wave packet becomes spread out into a 
distinctive double slit diffraction pattern. At this point the electron is still 
in a pure state, and it is at this point that I wish to make the analogy with 
Eq [5], the initial state for the toy system. After interacting with a screen, the 
electron is certainly not in a pure state, but the electron may be expressed as 
an incoherent superposition of localized packets. The probability assigned to 
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each packet is the same probability assigned to that location by the original 
pure electron state. (Extremely low probabilities are assigned at nodes of the 
double slit pattern, for example.) The loss of coherence of the initial state 
is due to the setting up of correlations between the electron and the screen. 
The screen plays the role of system 1 in equation (|j) (of course there would 
be more than two terms in the Schmidt expansion of the electron-screen 
system) . 

For each localized packet the screen is in a different orthogonal state. The 
extent to which the electron density matrix eigenstates tend to be localized 
packets rather than some other types of states is determined by the nature of 
the interaction, and the initial state of the screen. It is natural to expect the 
eigenstates to be localized, due to the local nature of interactions. Because 
of the correlation between the screen and the electron, one can determine 
the state of the electron by measuring the state of the screen. In fact, one 
normally does look at the screen, not at the electron. 

There are three key feature of the double slit experiment which are present 
in the toy system. First, the density matrix eigenstates (or Schmidt states) 
take a particular form after the measurement which is determined by the 
interactions. This "pointer basis" (||) is {|T)?|I)} i n the toy model, and 
fairly localized wave packet states for the electron in the double slit example. 

Second, the probability assigned to each density matrix eigenstate after 
the measurement corresponds to the same probability assigned to that state 
in the pre-interaction pure state. For the spin, this results because the coef- 
ficients "a" and "6" are the same in Eqns |8] and |9|. For the double slit case, 
the diffraction pattern is represented in the distribution of density matrix 
eigenvalues after the interaction. (This is why, after many electrons strike 
the screen, the diffraction pattern is produced.) 

Third, it is very unlikely that the process will reverse itself. For the 
double slit, it is extremely unlikely that the screen will emit an electron in a 
double slit diffraction pattern. The reason why the toy model is unlikely to 
evolve from Eq |9] back to Eq § will be discussed in Section |7|. 

One way the analogy does not work is in the details of the apparatus. The 
apparatus in the toy model is much less sophisticated than a realistic screen. 
Although the different "outcomes" of the measurement do get correlated with 
orthogonal states of the apparatus, the apparatus states do not represent a 
nice "pointer" or "mark on a screen" which clearly reflects the state of the 
quantity being measured. 
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4 Results 



Figure 1 shows information about the spin as the whole system evolves. 
Initially, the state is given by Eq (H), with a = 0.7, b = 0.3. In the lower plot, 
the solid curve gives pi, the largest eigenvalue of p%. It starts out at unity, 
as required by the "pure state" form of the initial conditions, and evolves 
to 0.7, where it holds steady. The dashed curve gives the entropy, S, of the 
spin (S = — ir[p2log2{p2)\) , in units where the maximum possible entropy in 
unity. The entropy starts out zero and increases. This is always the case 
when a system evolves from a pure to a mixed state. (Note the the combined 
"spin eg) apparatus" system remains in a pure state, so its entropy is zero) 

In the upper plot, the dashed curve gives the overall probability for the 
spin to be up, given by (T| P2 IT)- This quantity is a "constant of the motion". 
The solid curve gives | (| |l) s | 2 , where \ l) s is the eigenstate of P2 (or "Schmidt 
state") corresponding to the largest eigenvalue. 

Since |1) 5 belong to a two state Hilbert space, it is completely specified 
by |(| |l) s | 2 , up to an overall phase. One can see that as the eigenvalue {pi) 
approaches 0.7, the eigenvector becomes essentially ||). Thus the behavior 
promised in the previous section (Eqs (§) and @) is realized to a good 
accuracy. 

Figure 2 is another representation of the way the eigenstates of p2 evolve. 
The first row represents and the second row represents the other eigen- 
vector. The three columns correspond to three times. The histogram in each 
plot provides two numbers, p(|) = |(T |1)| 2 an d p(|) = \{[ |1)| 2 for the first 
row, and similarly for the second eigenvector in the second row. In this way 
one can visualize a "collapsing wavefunction" by following the eigenstates of 
p 2 as they "collapse" onto the {|T) > I!)} basis. 

5 Consistent Histories 

I will now make contact with the "consistent histories" or "decoherence func- 
tional" approach to quantum mechanics of closed systems. Until now I have 
been using the wavefunction to assign instantaneous probabilities to different 
states over a range of times. By contrast, the consistent histories endeavors 
to assign probabilities to histories. Consider two projection operators: 

P T = |TXTI®/i; A = \i){i\®h (10) 
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where I\ is the identity operator in the apparatus subspace, and {||), |J,)} 
form an orthonormal "projection basis" which spans the spin subspace. These 
projection operators sum to unity: 

A+iW. (11) 
One can take the formal expression for the time evolution: 

m)) = e- iHt \m) (12) 

and insert the unit operator (Pj + Pj_) at will, resulting, for example, in the 
identity: 

\m) = (A+A)e-^ ti) (A + A)^ mi i^(o)) (is) 

= P^- iH{t - tl] P^e- iHtl |V(0)) + P^^-^Pie-* 11 * 1 |V(0)) 

+P l e- lH(t - tl) P 1 e- iHtl \^(0)) + P^-^-^P^-^ 1 \if>(0)) (14) 

= l[U]> + l[U]> + |[U]> + |[U]>- (15) 

The last line just defines (term by term) a shorthand notation for the previous 
line. Each term represents a particular choice of projection at each time, 
and in that sense corresponds to a particular "path". In the path integral 
formulation of quantum mechanics the time between projections is taken 
arbitrarily small, and the time evolution is viewed as a sum over microscopic 
paths. For present purposes, the time intervals can remain finite, representing 
a "coarse graining" in time. Each term in the above expression is called a 
"path projected state" , and the sum is a sum over coarse grained paths. 

One attempts to assign the probability u ([i, j] \ [i, j})" to the path [i, j], but 
to make sense, the probabilities must obey certain sum rules. For example, 
one can define 

|[T,-]) = |[T,T]> + |[U]>, (16) 

where the "•" signifies that no projection is made at t\. One would want the 
probability for the path [j, ■] to be the sum of the probabilities of the two 
paths of which it is composed: 

([T, -]|[T> ■]) = ([t, T]|[T> T]> + ([T> l]|[T> I]) (17) 

However, one can "square" Eq ([16]) to give the general result: 

([T,-]|[T,-]> = <[T,T]|[T,T]> + <[T4]|[T4]> + <[T,T]|[T4]> + <[T4]|[T,T]> (18) 
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Table la Table lb 



path 


value 


path 


value 


< [T T] 1 [T T] > 


0.70 


<[JJ]|[JJ]> 


0.74 


([U]|[U]> 


0.00 


([XT]|[XT]) 


0.03 


(IT -Hit ■]) 


0.70 


mm) 


0.61 


% violation 


0% 


% violation 


25% 



Table 1: Testing the probability sum rule (Eq (15)) for different paths. For 
la the sum rules are obeyed for any choice of ti and t. For lb, t\ = .035 and 
t = 0.06 



Only if the last two terms (the cross-terms) in Eq ([18]) are small is the sum 
rule (Eq ([i~7])) obeyed. When the relevant sum rules are obeyed the paths 
are said to give "consistent" or "decohering" histories. Advocates of this 
point of view argue that the only objects in quantum mechanics which make 
physical sense are sets of consistent histories. For a discussion of how this 
simple example links up with the (much more general) original work on this 
subject ( |20], [21], |22|, \ZQ \Z§) see fT8|. Other work on the consistent histories 



approach includes [25, |2q 



6 Testing for consistent histories 

Table la checks the probability sum rule (Eq fll7D ) for the toy model whose 
evolution is depicted in Fig 1. The projection times are t\ = .15, t = .2, and 
the projection basis is {||) , ||)}. The sum rule is obeyed to the accuracy 
shown. In fact, using the {|f ) , projection basis, the sum rule is obeyed no 
matter which projection times are chosen and how frequently the projections 
are made. 

This result came as a surprise to me. After all the interesting dynamics 
described in Figs 1 and 2, the consistent histories approach offers a com- 
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pletely static perspective. The constant "|" and "|" paths are consistent 
right through the period when the correlations are being set up. 

One of the very interesting features of the consistent histories point of 
view is that typically there are many different sets of consistent histories. It 
turns out that for this particular example some sets of consistent histories 
reflect the "quantum measurement" more directly. 

Consider for a moment a static (Hamiltonian = 0) spin, not coupled to any 
apparatus. It turns out that as long as the same projection basis is chosen at 
t and ti, one always gets consistent histories. This is true for any projection 
basis. One could choose {|T) > II)} or one could choose the projection basis 
{|X), | _L)}, where |X) is the initial state of the spin (a||) 2 + &||) 2 ), and 
| _L) is the state orthogonal to it. A static spin would naturally result in 
unit probability for the [X, X] path, and zero probability for all other paths. 
Table lb shows the results for the fully interacting spin, using the {|X), | _L)} 
projection basis, but otherwise the same as Table la. Clearly the sum rules 
are not obeyed in this case. 

When {|T) ) II)} was used as a projection basis, there was no difference, 
from the consistent histories point of view, whether the interactions between 
spin and apparatus were present or not. Consistent histories resulted in 
either case. When the {|X), | _L)} projection basis was used, the effects of 
the interactions were evident: Only in the absence of interactions were those 
histories consistent. 



Gell-Mann and Hartle |24], g7[ have emphasized the important role that 



"records" or correlations among subsystems can have in producing consistent 
histories. In |18| I noted that since the Schmidt decomposition gives an exact 
account of whatever correlations are present, the Schmidt states (eigenstates 
of the reduced density matrix) often make a very good choice of projection 
basis. 

Indeed, I have found the following types of histories are always consistent 
for this toy system: For the first projection time one chooses the eigenstates 
of p 2 (the Schmidt states) as the projection basis. At the second projection 
time one chooses the {|T) > II)} projection basis. These paths are consistent 
for any choices of the two projection times. These paths certainly reflect the 
measurement process, since (as shown in Figs 1 and 2) this process shows 
up quite explicitly in the behavior of the Schmidt states. One can expand 
on this set of paths by including additional projections on the {|T) > II)} 
basis. However, one will not get consistent histories if one makes additional 
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projections on the Schmidt basis (until after it coincides with the {| j) , ||)} 
basis). Thus the actual picture presented by any of these sets of paths is 
quite different from the Schmidt paths depicted in Fig 1. 

7 Time evolution and the density matrix 

This section, and the one which follows, are devoted to describing how the 
Hamiltonian which governs the toy system is related to the results presented 
above. 

To explore the effect of Hj, note that the initial state (Eq H) can be 
written: 

|^) = a|T)®|X) 1 + 6|i> (S> |Jf> x . (19) 
Under time evolution according to Hj, this state maintains a similar form: 

|V(t)> = a |t) ® \Xi(t))t + b ||) ® \X l (t)) 1 . (20) 

Where |Xf(t))i and \Xi(t))i are the initial apparatus state \X) evolved for- 
ward in time according to H\ and H\ respectively. Because H\ and H[ 
are different randomly chosen operators, on average the states \X^{t))\ and 
\Xi{t))\ have no more overlap than two randomly chosen vectors in the ap- 
paratus subspace. For sufficiently large apparatus subspaces, the overlap will 
be extremely small. 

From this point of view the initial conditions, where (Xj (0)|Xj(0)) = 1, 
are very special. As time evolves the value of (X|-(0)|Xj(0)) decreases to 
its more natural small value. If (X|-(0)|Xj(0)) were to become close to unity 
later in time, this would correspond to the apparatus "forgetting" its records, 
analogous to the screen re-emitting an electron in a diffraction pattern state. 
In the toy model, this happens very rarely (for large apparatus) because two 
vectors evolving randomly in a large space rarely overlap. (This effect has 
been discussed at length in this context by Zurek ||.) 

Much of the earlier discussion has focused on the eigenstates of the re- 
duced density matrix for the spin (p 2 ), or Schmidt states. These states, along 
with the eigenvalues, provide the most concise description of the state of the 
spin, and they describe the correlations with the apparatus as well. The 
Schmidt states start out being very different from the the {||) , ||)} basis, 
but approach very close as the measurement is completed. The Schmidt 
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states then stabilize close to the {|T) > II)} states and do not change much 
after the measurement. 

The degree to which the Schmidt states are {|T) ? II)} can be studied by 
examining the off diagonal matrix element of p 2 in the {|T) > II)} basis: 

(l\p 2 (t)\l) = ab*(X l (t)\Mt))- (21) 

Equation pi] shows that to the extent that the overlap of X^ (t))i with \X±(t))x 
is small, {|T) > |l)} are the eigenstates of p 2 . As discussed in Appendix A, 
the typical overlap goes down as the size of the apparatus is increased, so 
even in this simple example one can see that large size is an advantage when 
building an apparatus. 



7.1 A Catch 

The case where a and b are nearly equal deserves special attention. In this 
case the eigenvalues of p 2 become nearly degenerate at late times, and the 
form of the eigenstates of p 2 becomes a delicate matter. 
Consider first the case of strict equality: 

a = b = 1/2 (22) 

In this limit (|| p 2 {t) |f) = {{\ p 2 {t) \ [) = 1/2, and the form of the eigenstates 
is completely determined by (T I P2 (^) |l)- In this special case the eigenstates 
are either 

(lt)±e"|Q) 

if (| I p 2 (t) ||) is non-zero (no matter how small! ) , or undetermined if (| | p 2 (t) 1 1 ) 
is exactly zero. (See Appendix A for the definition of the phase 9 and further 
details.) 

A physicist need never worry about the "measure zero" case where Eq |22] 
is exactly obeyed, but there is a more general point to be made: As a and 
b get close to one another, even very small values of (|| p 2 (t) |1) can be "too 
large" and cause the eigenstates of p 2 to deviate greatly from the desired ||) 
and \l) states. 

In the toy system, the mean magnitude of ('W p 2 (t) |j) is never zero, al- 
though it can be made arbitrarily small by increasing the size of the appara- 
tus. Thus for every apparatus size there exists a limit to how close a and b 
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can get without causing the Schmidt states to exhibit large deviations from 
the desired behavior. On the other hand, given any arbitrarily close values 
of a and b, there exists a sufficiently large choice of n\ so that the desired 
behavior is achieved. In Appendix A I show that the minimum value of 
\a — b\ scales as 1/ ' ■Jn\. (Note that the "ni" of a real macroscopic apparatus 
is huge!) 

I also argue in Appendix A that if one accepts the departure of (f| p2[i) \[) 
from zero as an indication of the precision of the apparatus, then there is 
nothing particularly wrong with the apparatus in the a — > b limit. The 
apparatus is just unable to precisely resolve the value of a — b. One could 
even argue that the sensitivity of the Schmidt basis to the precise value of 
(T| P2(t) ||) in this limit makes the Schmdit basis misleading when \a — b\ falls 
below the "experimental resolution" . (This amounts to a major concession 
to W. Zurek, with whom I have been having ongoing informal debates about 
the value of the Schmidt decomposition!) 

7.2 Some Red Herrings 

I came up against the special behavior discussed in Section |7.1| early in the 
course of this work. Although I appreciated the overall delicacy of the de- 
generate eigenvalue case, my efforts to preserve the desired behavior in that 
limit were not always to the point. In this subsection I critique some remarks 
on this subject in previous papers of mine. 

The eigenstates of Hj (from Eq [7]) have the form of either: 

|A/> = IT) ® |A T ) (24) 

or 

|A/> = II) ® \Xi) (25) 

where the |Af) and |Aj_) are the eigenstates of H\ and H{ respectively. The 
addition of sub-dominant "self-Hamiltonians" for the two subsystems does 
not have a large overall effect. However, frequently a handful of energy 
eigenstates deviate greatly from Eqs ^ and (The reason is that among 
the random set of energy eigenvalues there are always a few which are quite 
close together. Under such circumstances small perturbations can greatly 
affect the form of the eigenstates, as was already discussed regarding p 2 .) 
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Some of my previous efforts to produce a "good measurement" in the 
a — > b limit focused on avoiding the bad energy eigenstates, which are not 
close to Eqs and In ]Tj| (section 6.1) I further reduced the coefficients 
of the self-Hamiltonians, and in |12| I specially chose the initial conditions 
to avoid the bad energy eigenstates. In fact none of these efforts were useful, 
because they did not reduce the overlap of |Xf(t))i with \X±(t)) 1 . This 
overlap is present even when the energy eigenstates are exactly given by 
Eqs [24] and ^5], and the form of the eigenstates was not the problem which 
needed addressing. The overlap is most easily reduced in the toy system by 
increasing the the size (ni) of the apparatus. 



8 Consistent histories and the Hamiltonian 

The states ||) and ||) for the spin are absolutely stable under the action 
of Hi. Once one projects with Pj the projected state will remain of the 
form ||) (g> |something)i from then on. Projecting with Pf and Pj at different 
times is certain to give zero. If {||) , ||)} is the projection basis, then the only 
path projected states with non-zero amplitude have all the projections either 
uniformly up or uniformly down (regardless of the values and frequency of 



the projection times). The only cross-term in Eq 18 which could potentially 
cause sum rule violation is the dot product between the uniformly up and 
uniformly down path projected states. This cross-term is also zero because 
(Tl I) — 0. Thus using the {|T) ? II)} projection basis is sure to give con- 
sistent histories for this model, no matter what the initial state. Although 
these histories do not explicitly exhibit dynamics associated with the evolving 
correlations, their consistency is closely linked with these dynamics via the 
special stability of the {||) , ||)} basis. In Zurek's || language, the {|T) > II)} 
basis is a "pointer basis", which does not loose quantum coherence via the 
interactions. 

The {|T) ) II)} basis is special because of the form of Hj (Eq [7]). The 
basis {|X), | _L)} is nothing special from the point of view of the Hamiltonian, 
and it is not surprising that consistent histories were not found using that 
projection basis. 

The other sets of consistent histories discussed in Section 9 involved pro- 
jecting first on the Schmidt states and then on the {|T) , |{)} basis. After the 
first projection, the two resulting path projected states are just the two terms 
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(v/pi|l)f ® I l)f an d i/P2|2)f ® |2)f) of the Schmidt decomposition of the total 
wavefunction at t\. The subsequent evolution of each path projected state 
may be treated as in Eq |2D|: 

|[i](t)) = e-^)^(vpT|i)f®|i)f) = [T> (VpKTI i>f) ® |i T (*)>i 

+ 11) (v^U|i)?)®|ii(*)X26) 



where and |l|(t))i are |l)f evolved under H\ and H[ respectively. 

Likewise: 

|[2](t)) = e-^- t ^(V^|2)f®|2)f) = |t)(v^(t|2)f)®|2 T (0)i 

+ ||) (V^U|2) 2 5 )®i2 i (t)}427) 



Given the form of Eqs |26| and [H] it is easy to see the effect of later projecting 
on the {|t) , ||)} basis. The resulting four path projected states are just the 
two terms from Eq |26] and the two terms from |27|. The relevant cross-terms, 
which must be zero to give consistent histories are: 

<[T,1]|[T,2]> oc (l T (t)|2 T (t)> (28) 

and 

(U,l]|[i,2]><x<l;(t)|2 ; (f)> (29) 

The quantity in Eq ^ is exactly zero because |1|(£)) and |2j(t)) started 
orthogonal, and were unitarily evolved by the same Hamiltonian, so they 
must remain orthogonal. Likewise for Eq [29] . 

Unlike the first set of consistent histories discussed, these histories are 
consistent because of orthogonality of the path projected states in the ap- 
paratus subspace. One can say that records of the spin at ti are present in 
the apparatus. The Schmidt decomposition (at t\) was use to resolve these 
records. (If any other projection basis had been used at t\, the counterparts 
of |l-f(t)) and |2-|-(t)) would not have started out orthogonal, and the cross- 
terms would not have come out zero.) The choice of second projection was 
also crucial. By choosing the stable {||) , ||)} basis, one avoided loosing track 
of the records between t\ and t 2 , when the second projections were made. 
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9 Comparing consistent histories with instan- 
taneous probabilities 

The consistent histories approach involves assigning probabilities to histories. 
In contrast, the wavefunction at a particular time can be used to assign a 
probability to any state (possibly a state specified only for a subsystem). One 
just projects onto the state in question and squares to get the probability. 
This procedure can be repeated at different times (always evolving the whole 
unprojected wavefunction). The Schmidt paths just give a way of following 
the probabilities assigned to a particular set of states. Often these Schmidt 
states are very interesting because they exactly reflect the correlations which 
are present. 

The consistent histories formalism actually coincides with the instanta- 
neous probabilities view in the special case where the "paths" are defined at 
a single instant, utilizing just one projection time. In this case there is no dif- 
ference between assigning a probability to a "path" or a state. In fact, such 
probabilities automatically obey all necessary sum rules, which is why no ad- 
ditional consistency conditions are discussed when taking the instantaneous 
probabilities view. 

The consistent histories formalism allows one to go beyond the instan- 
taneous view and (at the cost of extra conditions) assign probabilities to 
extended histories. That is, histories defined over more than one moment in 



time. A number of authors (for example |25|, |30], |28| ) have attached great 
importance to this way of going beyond the instantaneous point of view. 

In general the extended histories and the instantaneous probabilities offer 
very different points of view. However, the the two can be quite similar in 
the particular case where good measurements are made within the closed 
system being treated. In that case projecting on a particular set of records 
(at a single time) should be completely equivalent to projecting (even at 
earlier times) on the corresponding state of the system being recorded, as 
long as the projection is made at a time after the measurement has been 



completed (see for example refs 27, |25[ ). Since the Schmidt decomposition 
can be applied to expose the correlations among all the relevant apparatuses 
and systems, one might expect that the paths traced out by the Schmidt 
states after a measurement should bear a lot of resemblance to one set of 
(extended) consistent histories. However, based on the work in this paper, 
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it does not seem that the extended consistent histories and the Schmidt 
paths bear much resemblance during the measurement process, when the 
correlations are actually being set up. 

Much of this paper studies the measurement process from the instanta- 



neous probabilities point of view. It has rightly been pointed out p5|, ^4 



|28| , pOfl that the information provided by a wavefunction at a single moment 
in time is of limited use in investigating many important issues in quantum 
mechanics. None the less, by following the time development of the instanta- 
neous probabilities one is able to provide some useful insights into the nature 
of the quantum measurement. (This is the used in the pioneeing work by Zeh 
and Zurek. The whole notion of Zurek's "pointer basis" or Zeh's "stability 
of the Schmidt states" |TJ is connected with time evolution, as is the issue 
of "permanence" of the record, which both these authors address. Their 
analysis, which involves the time development of instantaneous probabilities, 
is very different from just looking at a wavefunction at a single moment in 
time.) 

Like Zeh and Zurek, I have found the time development of the reduced 
density matrix to offer a convenient perspective on the measurement pro- 
cess. One can answer questions such as "how long does the measurement 
take?"|| |3T|, and "what is the state of the system half-way through the 



measurement?" (Fig 2). In turn these insights can help one deduce the fea- 
tures which make a good measurement apparatus. 

In this particular application I have not found the generalization to ex- 
tended histories offered by the consistent histories formalism particularly il- 
luminating. No single set of extended histories appeared to be following the 
correlations in any continuous way, and no set indicated the duration of the 
measurement process. No doubt these features can be extracted by consid- 
ering a large number of different sets of extended consistent histories but not 
in a particularly direct way. In short, the time development of the reduced 
density matrix seems to allow one to focus more directly on the measurement 
process, as compared with the extended histories point of view. 

This is not to say that the different focus offered by the extended histories 
is "bad" . After all, in many realistic situations one does not want to focus 
on the details of the measurement process. For example, the "constant" spin 
up and down consistent histories are probably exactly how an observer who 
measures the spin in the {||) , ||)} would want to think of the history of 
the spin. Whether the spin was once in a coherent superposition of up and 



18 



down, and whether some other system had already measured the spin in the 
{IT) 5 II)} basis before his measurement occurred would be of no practical 
interest to the observer^. 



10 The arrow of time 

As has been noted, for example by Zurek and Zeh |3B|, |T5], |3H{] , there is 
an arrow of time built into the dynamics discussed here. This is dramatized 
in Fig 3, which is identical to Fig 1, but with the time axis extended to the 
interval [—2,2]. One can seen that the pure "initial" (t = 0) state (which 
has zero entropy for the spin), is a very special state and the "collapse of 
the wavefunction" proceeds in the direction of increasing spin entropy. The 
t < part of Fig 3 illustrates an "un-collapsing" wavefunction, where the 
correlations present between spin and apparatus at early times are lost, and 
the pure state emerges at t — 0. Then, for positive values of t correlations are 
established again. The stability of these correlations (and thus the goodness 
of the measurement) depend on another such "entropy dip" not occurring for 
t > 0. In the language of Section 7, one is depending on the random evolution 
of |Xf(t))i and \X±(t))i not to cause these two states to overlap appreciably 
at later times. (This issue has been discussed at length in ||.) Even the 
simple system discussed here is complex enough for such large entropy dips to 
occur very rarely. Still, with such a small apparatus, noticeable fluctuations 
are present. (Note that the portion of Fig. 3 which is shown in Fig 1 is 
uncharacteristically well behaved. See Appendix C for further discussion.) 

Aside from questions of stability, how fundamentally is the arrow of time 
linked to quantum measurement? The initial state, has zero entropy for 
the spin, so it is not surprising that just about anything will cause the entropy 
to increase. What about starting with a more general initial state? Schmidt 
tells us that (in a suitable basis) the most general state can be written 

l^) = v^l 1 )2®|l)i + V^|2)2®|2)i. (30) 

I show in Appendix B that if one requires evolution which generalizes Eq (|]) 
to give 



tFor some related ideas see 132f| 
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= v«(T |i>2lT>®l^>i + U|i>2|i>®|£>i) 

+V ^((T |2) 2 IT) ® \C) l + (I |2) 2 |i) ® 1^)0 (31) 

then one must have increasing (or constant) entropy of the spin (— ir[p 2 ln(p2)]) 
as l^j) — > |V>/). Thus "good measurement" appears to be closely linked with 
increasing entropy, even for high entropy initial states. (Note that I have 
chosen all four apparatus states, \A)\, \B)\, \C)\, and \D)i to be mutually 
orthogonal. This means that in \ipf) the apparatus has a record of whether 
the spin is up or down, and which term of Eq (|30|) has been "measured".) 



11 Conclusions 

The ideas put forward by Zeh ]l], Zurek ||, Joos and Zeh [35], and Unruh 



and Zurek @, have sufficiently de- mystified the notion of wavefunction col- 
lapse that one can actually unitarily follow the evolution of a system right 
through the collapse process. I have investigated a simple system which ex- 
hibits "wavefunction collapse" . I find Zeh's idea of watching the evolution of 
the eigenstates of the reduced density matrix (Schmidt paths) particularly 
appealing. This approach allows one to follow exactly the evolution of the 
correlations among subsystems. It also allows one to visualize the collapse 
process quite explicitly, as illustrated in Fig 2. However, when eigenvalues 
of the reduced density matrix are nearly degenerate, the eigenstates become 
very sensitive to "noise", and can give a misleadingly unstable picture of 
what is going on. 



I also applied the "consistent histories" analysis (of Griffiths [p0 |, Omnes 



ZI, ^2], |23| and Gell-Mann and Hartle ||24|| ) to the same system. In one 
limit, this approach can reproduce the reduced density matrix results where 
probabilities are assigned at instants in time. More generally, the consistent 
histories allow one (when the consistency conditions are satisfied) to assign 
probabilities to extended histories of the system. In the example studied 
here, many different sets of histories passed the consistency test. It is in- 
triguing that one set of consistent histories for the spin did not reflect the 
interesting evolution of the correlations between the spin and the appara- 
tus. Instead, it was more a reflection of the stability properties of the spin. 
That set of histories would look the same for a static spin, decoupled from 
the apparatus. Other consistent histories exhibited more direct links to the 
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"quantum measurement" process underway. However, there was very little 
resemblance between any given set of extended consistent histories and the 
Schmidt paths for the system. I argued that the reduced density matrix of- 
fered a more convenient point of view from which to analyze the measurement 
process. 

I have employed a perspective on wavefunction collapse which explicitly 
does not make a choice among the possible outcomes at the end of the mea- 
surement process. This results in Everett's "many worlds". An advantage of 
this perspective is that the question of what makes a good apparatus can be 
addressed quite directly. To this end, I have discussed in detail the Hamilto- 
nian used to evolve the system, and the features necessary to accomplish the 
desired evolution. Even in this primitive example, the quality of the appara- 
tus is very clearly linked with its size, and with the statistical "irreversibility" 
associated with the thermodynamic arrow of time. 33 
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A Nearly degenerate density matrices 



A.l Mathematics 

Consider the matrix: 

Its (un- normalized) eigenstates are: 

(33) 




If one takes the limit uj —>■ while keeping <5 fixed the eigenstates become 
proportional to (0,1) and (1,0). This is what the toy model is trying to 
accomplish for p2 at late times, by making the off-diagonal terms (here uj) 
small. However, if one takes 5^0 while keeping uj fixed the eigenstates 
become proportional to (±— , 1). (Since uj corresponds to (|| p2 ||), 9 in Eq 
H is just the phase of a&*(Jf;(£)|X T (£)).) 

If one wants to require the eigenstates to be close to (0, 1) and (1, 0), one 
can require: 

5 - VS 2 + uj*uj /nA . 
< e (34) 

UJ 

for some small epsilon. For small values of Eq |34] becomes 

6 > £ (35) 

This paper involves the case where uj is the overlap of two random vectors 
in a space of size n\. The magnitude of such a quantity is the net distance 
traversed by an ni-step random walk with average step size proportional 
to ^/n{ , so uj oc 1/ ^fn{ . Combining this with Eq [35], on can see that the 
minimum allowed value of S goes as 1/ ^Jn{ . 



A. 2 Physics 

The goal of the interactions was to get the wave function into the form given 
by Eqg 

|V/) = o|T> a ®|y)i + 6U>a®l^>i (36) 
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with 

(Y\Z}=0. (37) 

When a — > b the Schmidt decomposition tells us that if one insists on Eq |37| 
holding exactly (which is what Schmidt does), then the Schmidt expansion 
can look very different than Eq |36[ This is because even small non-zero values 
of (Xf(0)|Xj(0)) (= uj in Eq |32D can have a large impact on the eigenstates 
in this limit. However, how badly does the wavefunction deviate from Eq |36| 
when |(X T (0)|X i (0))| » \a - b\ (that is, when 1 » u » 8 in Eq ||)? 
One check is to look at the "overlap" between actual reduced density matrix 
for the spin (p 2 given by Eq |32|, for example) and the ideal result 

H 1/2 o + V-*)- (38) 

The quantity ir^p 2 pi is a good measure of the overlap which can be under- 
stood by writing each p in terms of its eigenstatesQ. The value of ir^fp^pj is 
unity when p 2 = pi, and is zero when no eigenstates (with non-zero eigen- 
values) overlap. Taking p 2 from Eq^2|and pj from Eq|38|, and expanding for 
small D (keeping 8 fixed) one gets 

ir^p 2 -p- 1 = l + 0(w 2 ). (39) 

At least according to this measure, p 2 and pi are very close when uj is small, 
even when their eigenstates are very different. 

One can further explore the suitability of the system as a "measurement 
apparatus" in the a — > b limit by considering the interaction of a third sys- 
tem with the apparatus. One feature of a good apparatus is a "pointer", 
which clearly exhibits the outcome of the measurement, and which can sub- 
sequently be measured by other systems to determine the outcome of the 
original measurement. (Zurek [Q] emphasizes this point by clearly partition- 
ing out the pointer from the rest of the apparatus). As discussed earlier, this 
feature is absent from the toy model. 

For the sake of discussion I will force the issue by assuming there is a 
third system which has detailed information about \X^(t)). It can use this 
information to suitably measure the apparatus. If the apparatus is found 



■'"The square root of the operator P2P1 is defined in the usual way. The operator is 
expressed in its eigenbasis and the square root of its eigenvalues are taken. 
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in |Xf(£)) the third system will conclude that the spin is up. The issue 
is being "forced" only in the sense that one is asking the third system to 
know something very complicated (namely \X^(t)), a complete "microscopic" 
state with messy time evolution) in order to use the apparatus. In a good 
apparatus some simple feature (such as a blip on a screen) should indicate 
the outcome. 

So how much of a mistake does the third system make by using this 
procedure? The errors come because the overlap of \ipf) with \X^(t)) receives 
contributions not just from the first term in Eq^(| which is indeed correlated 
with |t), but also from the second term (correlated with ||)) due to the non- 
zero value of {X±(t)\Xi(i)). To the extent that (Xi(t)\X^(t)) is small, the 
errors are small even in the a — > b limit. The size of (Xi(t)\X^(t)) simply 
represents the precision of the apparatus. 

Repeated measurements by the third system of identically prepared spin- 
apparatus systems should yield inferences of the values of a and b. These 
inferences should be increasingly good as the number of repetitions increases. 
The one "problem" encountered as a — > b is that the actual value of a — b 
falls below the precision of the apparatus. In this case the third system could 
only conclude that a « b within the experimental uncertainties. As long as 
the precision of the apparatus is acceptable, there is no problem with the 
apparatus in the a — > b limit. 

However, by choosing to look at the Schmidt decomposition, one is look- 
ing at something which can be very sensitive to a — b, as illustrated at the be- 
ginning of this appendix. In the case where the magnitude of a — b falls below 
the acceptable resolution of the apparatus, one could argue that the Schmidt 
decomposition can be very misleading. For example, the spin-apparatus sys- 
tem could be in a state sufficiently close to Eq |9] for practical purposes, 
but the Schmidt decomposition could yield something that looks completely 
different. 

This more wary attitude toward the Schmidt decomposition represents a 



step back from the enthusiasm I have expressed on other occasions (see [[18 
at the very end of section 2.2, for example). 
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B Generalized measurements and the arrow 
of time 

The point of this Appendix is to show that the final state in Eq |31] has higher 
entropy (relative to the spin-apparatus partition) than the generalized initial 
state given by Eq |30]. 

Equation |30] is manifestly in Schmidt form, and Eq |3l| can be put in 
Schmidt form by collecting even and odd terms together. For the initial 
state, the eigenvalues of the density matrix are p% and p 2 . For the final state, 
the eigenvalues are 

p T = p 1 |(T|l) 2 | 2 + P 2 |(T|2) 2 | 2 (40) 
Pi = 1-Pt- (41) 

The fact the \(A\C)\ =i (B\D)\ = is important for obtaining Eq g^. 

Since the entropy is monotonically decreasing in \pi — p 2 \, it will suffice 
to show that \p^ — p{\ < \p\ — p 2 \- Without loss of generality I take p^ > p± 
and pi > p 2 . 

PT-n = 2 PT- 1 = 2 (piKT |1) 2 | 2 + P2KT |2) 2 | 2 ) -I- (42) 

Now define: A = p\ — p 2 . Using this definition and p\ +p 2 = 1 on can rewrite 
Eq as: 

PT -PI = 2 (— !j— KT l 1 )^ 2 + I2) 2 | 2 ) " 1 (43) 

= A(l(T|l) 2 | 2 - l(T|2) 2 | 2 ) (44) 
= A(2KT|l) 2 | 2 -l) (45) 

where the normalization condition (T I T) = KT |1) 2 | 2 + |(T |2) 2 | 2 = 1 was 
used in the final step. Since (2|(f 1 1 ^> 2 1 2 — 1) is manifestly bounded above by 
unity, the desired result, |pf — pjj < \pi — p 2 \, is obtained. 

C Search technique 

Figure 1 is a "blow up" of a small portion of Fig 3. The reader might 
have noticed that the portion shown in Fig 1 is much closer to the "desired 
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behavior" than any other portion of Fig 3. This is due to the fact that 
I did a fair amount of fiddling around, trying to choose parameters which 
would make a good quantum measurement. The time range I looked at while 
searching parameter space was the same range used in Fig 1. Given this 
search "technique" , it is not surprising that my search ended on an atypical 
case. I stopped when I had found what I wanted (within the window of Fig 
1). On could say that Fig 1 is slightly misleading. On the other hand, one 
could just as well say that I understand the apparatus: I am able to prepare 
it in a suitable manner so that a good measurement is performed, and the 
record is kept for a specified period (in this case, .2 units of time). Just about 
any apparatus must be dealt with in this way. 
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Figure Captions 

Figure 1: a: The solid curve is |(f ll) 5 ] 2 , and the dashed curve gives (f| p 2 It)- 
b: The solid curve is the largest eigenvalue of p2, the dashed curve is the 
entropy of the spin. 



Figure 2: "A collapsing wavefunction." Each plot depicts an eigenstate of 
p 2 in terms of = |(j \i)\ 2 and = |(j |i)| 2 . The columns correspond 
to three different times. The two rows correspond to the two eigenstates. 
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Figure 3: The same plots as Fig 1 extended over a wider time range. 
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